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Abstract 

We present a linear programming based algorithm for computing a spanning tree T 
of a set P of n points in IR rf , such that its crossing number is 0(min(i log n, n 1_1 / rf )), 
where t the minimum crossing number of any spanning tree of P. This is the first guar- 
anteed approximation algorithm for this problem. We provide a similar approximation 
algorithm for the more general settings of building a spanning tree for a set system 
with bounded VC dimension. 

Our approach is an alternative to the reweighting technique previously used in 
computing such spanning trees. 

1 Introduction 

The reweighting technique is a powerful tool in computer science |AHK06| . In Computational 
Geometry, it was introduced by Chazelle and Welzl |CW89| who used it to compute spanning 
paths with low crossing number in set systems with bounded VC dimension. Welzl |Wel92| 
provided a tighter analysis for the case of spanning tree of points in R d . Matousek |Mat92| 
used the reweighting technique to provide a powerful partition theorem that proved to be very 
useful in building range searching data-structures |AE98| . Also, Clarkson |Cla93| provided 
an algorithm for polytope approximation that used the reweighting technique. Bronnimann 
and Goodrich |BG95| realized that Clarkson's algorithm implies a general method for solving 
hitting set and set cover problems in geometric settings. 

Interestingly, Long |Lon01| had observed that set cover problems in geometric settings 
can be solved by using LP and taking a random sample (guided by the LP solution) that is an 
e-net (a similar observation was later made by |ERS05| ). In fact, such packing/covering LPs 
can be solved efficiently via reweighting |PST91| . Thus, one can interpret the reweighting 
algorithm for solving the geometric set cover problem as directly solving the associated LP. 
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The result of Welzl |Wel92]. mentioned above, is quite in- 
triguing. It shows that for a set P of n points in the plane 
(resp., in R d ) one can find a spanning tree of the points, such 
that any line (resp., hyperplane) crosses at most 0(y/n) (resp., 
C^n 1-1 ^)) edges (i.e., segments) of the spanning tree. To ap- 
preciate this result, consider the point set formed by the grid 
y/n x ^fn. It is trivial in this case to come up with a spanning 
tree with a crossing number 0(yfn) - any spanning tree of the 
grid points using only edges of the grid has this property, see 
figure on the right. Surprisingly, the result of Welzl |Wel92 




implies that any point set behaves like a grid point set as far as the crossing number of the 
optimal spanning tree. 

In this work, we establish a connection between computing spanning trees with low 
crossing number and LPs (i.e., linear programs); that is, we show that spanning trees with 
low crossing number can be computed using LP rounding. 

Approximate spanning tree with the lowest crossing number. Given a set system 
X = (P, 5F) of finite VC dimension r, we show how to compute, in polynomial tim^l, a 
spanning tree of P with crossing number 0(t\ogn) (assuming t = Q(\ogn)), where t is the 
minimal crossing number of any spanning tree of P. This is done by recursively solving a 
LP relaxation and rounding it. See Section [2] for details. 

Naturally, this algorithm also applies to the Euclidean case. Specially, given a set P of 
n points in R d one can compute, in polynomial time, a spanning tree T such that every 
hyperplane crosses at most 0(t log n) edges of T, where t is the minimum crossing number 
of any spanning tree of P. 

Surprisingly, this is the first guaranteed approximation algorithm known for this problem. 
In particular, achieving such an approximation is mentioned as open in the Open Problems 



Project (see http : //maven . smith . edu/~orourke/T0PP/P20 . html#Problem . 20[) . 



Spanning trees in R d with 0(n l ~ l l d ) crossings. We also modify the analysis of our 
algorithm (but not the algorithm itself) so that it yields worst case bound on the crossing 
number. Specifically, we get a polynomial time algorithm that, for a given set P of n points 
in R d , computes a spanning tree T of P such that any hyperplane in R d crosses at most 
0(77. 1-1//d ) edges of T. Our proof of the correctness of the algorithm is self contained (except 
for a relatively easy lemma, see Lemma IX2l) . and uses LP duality. We believe the new proof 
provides a new insight into why such trees exist. In particular, Chazelle and Welzl |CW89| 
and Welzl |Wel92| proofs of the existence of such spanning trees are simple but somewhat 
"mysterious" (at least for the author, but other people might not see the mystery). 

Here is a sketch of the resulting argument why such trees exist: In the plane, it is sufficient 
to find spanning forest that span at least fl(n) vertices of P (in connected components that 
are not singletons) and has crossing number t. One can find such a (fractional) spanning 
graph by doing LP relaxation. The dual LP then asks (intuitively) to separate the given n 
points into singletons by a set of lines of minimum cardinality (i.e., for any p, q G P, there 
exists a selected line that crosses the segment pq). It is not hard to show that any such set 



1 We make the standard assumption that solving a LP of polynomial size takes polynomial time. 
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of lines need to be of size Q(y/n). A somewhat more involved argument (since we are dealing 
with a fractional solution of an LP that has some other constraints) implies that the dual 
LP is feasible for t = y/n and its optimal solution is bounded from below. It follows that 
the primal LP is feasible. Now , solving the primal LP and using a straightforward rounding 
implies that one can compute the required spanning graph. Applying this recursively by 
selecting a vertex from each connected component, and overlaying the resulting spanning 
graphs together results in a connected graph of P with crossing number 0(y/n). See Section [3] 
for details. 

Interestingly, while the above algorithm works for any point set in IR d , one can do slightly 
better in the planar case, and get a deterministic rounding scheme, see Section [37TI for details. 



Previous work. Fekete et al. |FLM08| suggested using LP relaxation to compute a span- 
ning tree with low crossing number. Their LP is considerably more elaborate than ours 
(considering all cuts), and their iterated rounding scheme seems to perform quite well in 
practice (although they are unable to provide a theoretical guarantee on the performance). 
Furthermore, they prove that computing the spanning tree with minimal crossing number is 
NP-HARD. 



Organization. In Section[2]we show the 0(t\ogn) approximation algorithm for spanning 
trees with low crossing number, for the general case of a set system with low VC dimension. 
In Section [3l we specialize this algorithm for the case of points and hyperplanes in H d . We 
discuss our results and some related open problems in Section [U 



2 Approximating the spanning tree with optimal crossing 
number 

Consider a set system X = (P, of finite VC dimension r. For more details on spaces with 
bounded VC dimension see |PA95| . For our purposes, it is sufficient that 5F is a set of subsets 
of P of cardinality bounded by |P| r , and this holds for any set system induced by a subset 
of P. 

For two distinct points p, q G P, we will refer to the set {p, q} as an edge, denoted by 
pq. An edge pq crosses a set S G 5" if |{p, q} fl S*| = 1. 

The crossing number of a set of edges F of P is the maximum number of edges of F 
crossed by any set of 3\ 

Example 2.1 As a concrete example of such a set system, consider a set P of n points in 
the plane, and let 



h + is a halfplane j . 



The set system X = (P, 5F) in this case has VC dimension 3, and a spanning tree T of P with 
crossing number t, is a spanning tree of P, drawn in the plane by straight segments, such 
that every line (i.e., the boundary of a halfplane) intersects at most t edges of T. 
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Lemma 2.2 Assume there exists a spanning tree T for P with crossing number t. Then, 
for any subset X C P there exists a spanning tree with crossing number at most 2t. 



Proof: Convert the spanning tree T of P, with crossing number t, into a closed cycle C 
visiting the points of P, by doing an Euler tour of T, using each edge of T twice. The new 
cycle C has crossing number 2t. Next, shortcut the cycle C such that it uses only elements 
of X, by replacing each subpath ir xy (that uses inner vertices that are not in X) connecting 
x, y G X by the edge xy. 

This results in a cycle that visits only the vertices of X and has crossing number < 2t, 
as such a shortcutting can only decrease the number of edges crossing a set S G 3\ Indeed, 
consider a subpath xiX2,X2X%, . . . ,x m -ix m , and observe that if x\x m crosses a set S, then 
one of the edges in the path must also cross this set. Thus, replacing a subpath (of a cycle) 
by an edge reduces the crossing number of the cycle. ■ 

Let E (P) denote the set of all edges of P, and consider the following LP (parameterized 
by t). 

7(P,t) = max £ y pq (1) 

pqGE(P) 

s.t. £ Vp^<t VjS G "J 

pqGE(P),|pqnS'|=l 

£ *pq>l V P GP (*) 

qeP.q^p 

y pq >0 VpqGE(P). 

Intuitively, this LP tries to pick as many edges as possible (i.e., y pq = 1 indicates that 
we pick the edge pq), such that (i) no set is being crossed more than t times, and (ii) every 
point of P participates in at least one edge that is being picked. 

Remark 2.3 The above LP might not be feasible, and in such a case j(P,t) is not defined. 
Naturally, one can modify this LP to first compute the minimal value t for which it is feasible, 
and then solve the original LP with this value of t. 

In particular, in this case, we can choose almost any arbitrary target function to optimize 
the LP for. We had chosen this one since the dual form is convenient to work with, see 
Section [31 

Lemma 2.4 Consider a set system X = (P, 5F) with bounded VC dimension, where n = 
\P\, and let t be a parameter such that y(P,t) is feasible. Then, one can compute (in 
polynomial time) a set of edges F, such that E[|P|] = y(P,t), and the number of connected 
components of the graph (P, F) is (in expectation) at most (9/10)n. The crossing number of 
F is 0(t + log n/ log log n) with high probability. 

Proof: We solve the LP (pp) and compute j(P,t). Next, for every pq G E(P), if y pq > 1 then 
we add pq to F. Otherwise, if y pq < 1 then we pick the edge pq into F with probability y pq . 
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For a set S G 5F, let X s be its crossing number in P. We have that 

H = e[X s ] < ^ y pq < t. 

pqeE(P), 
|pqnS| = l 

/ log Tl 

For a constant c > sufficiently large, let 5 = c + c (logn) / 1 t log ) , and observe that 



t 

tS log 5 = fi(logn). As such, by the Chernoff inequality, we have that 



Pr[X s > (1 + 5)t] < Pr 



X 5 >(l + ^]u 



< 



exp(t5/ n) 

{i + td/ij 1 y+ ts /^ 



exp ( tS - (fi + tS) ln( 1 + — J J < exp(t5 - i<5 log 5) < 



/i j j n°W ' 

Since X has VC dimension r, the number of sets one has to consider (i.e., the size of 3) is 
0(n T ) |PA95| . which implies, by the above, that the crossing number of F is bounded by 
(1 + 5)t = 0(t + logn/ log logn), with high probability. 

As for the number of connected components in the graph G = (P,F), observe that a 
point p is not adjacent to any edge of F with probability 

JJ (1 - j/pq) < exp I - fpq ) ^ -> 

q^p 

by the inequality (*) in LP (JTJ). Let Y be the number of points of P that are singletons in 
the graph (P,F). By the above, we have that E[^] < n/e. As such, the expected number 
of connected components in (P, F) is 



< E 



n n 9 
< - + - < — n. 
~ 2 e ~ 10 



Theorem 2.5 Consider a set systemX = (P, 3") with bounded VC dimension, where n = |P|. 
Let t be the minimum crossing number of any spanning tree of I. Then, one can compute, in 
polynomial time, a spanning tree T of P with a crossing number 0(t logn + log 2 n/ log logn). 

Proof: We set P\ = P. In the ith iteration, we compute the minimal ti for which 7(Pj,^) 
is feasible. Next, we compute a set of edges Pj over p, using Lemma [2T4l If the number 
of connected components of (Pj,Pj) is larger than (19/20) |p|, we repeat this iteration (we 
have constant probability to succeed by Markov's inequality). Next, from each connected 
component of Pj, we pick one point into Pj+i. We repeat this algorithm till we remain with 
a single point. This algorithm performs m = O(logn) iteration. Now, the union P = UjP, 
forms a spanning graph of P, and we return any spanning tree T of (P, P). 

The crossing number of T is bounded by the total crossing numbers of the graphs G\ = 
(Pi, Pi), . . . , G m = (P m , F m ). Now, the graph Gi has crossing number 0{U + logn/ log logn) 
by Lemma 12 .41 By Lemma 12.21 U < 2t, for all i. As such, the crossing number of T is 

0(^2^ + logn/ log logn) j = 0{t logn + log 2 n/ log logn). ■ 
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When P is a set of points in R d , we will be interested in the spanning tree having the 
minimal number of crossings with any hyperplane. In particular, the above result implies 
the following. 

Corollary 2.6 Let P be a set of n points in R d , and t be the minimum crossing number of 
any spanning tree of P. Then, one can compute, in polynomial time, a spanning tree T of 
P with a crossing number A = 0(t\ogn + log 2 n/ log log n). Specifically, any hyperplane in 
R d crosses at most A edges (i.e., segments) ofT. 



3 Spanning tree in H d with low crossing number 



Let P be a set of n points in the plane in general position (i.e., no three points are colinear). 
Let £j(P) denote the set of all partitions of P into two non-empty sets, by a line that does 
not contain any point of P. For each such partition, we select a representative line that 
realizes this partition. We slightly abuse notations as refers to as a set of these lines. 

We are interested in the question of finding a spanning tree T of P such that each line 
of &(P) crosses at most 0(y/n) edges of T. 

Definition 3.1 For a set of lines L in the plane, the crossing distance between two points 
is the number of lines of L crossed by the segment formed by these two points. Formally, 
for any two points p,q 6 R 2 , the crossing distance between them is dz,(p, q) = x + y/2, 
where x is the number of lines of L having p and q on opposite sides, and y is the number 
of lines that contain either p or q. It is easy to verify that rfi(-) complies with the triangle 
inequality (as such its a pseudo-metric). 

The crossing disk of radius r centered at a point p, is the set of all vertices of the 
arrangement A(L) in crossing distance at most r from p. We denote this "disk" by D^(p, r). 

We need the following lemma due to Welzl |Wel92| . 



Lemma 3.2 ( |Wel92] ) Let r > be a parameter, L be a set of lines (of size at least 
2r) in the plane, and let p be a point in the plane not contained in any line of L. Then 
|DL(p,r)|>Ct 1 )- 

Here is the LP (jT]) specialized for this planar case, and its dual LP. 



7 ; (P, t) = max x pq 




pqSE(P) 




S ,t , ^^^^^ 3u pq t 


W e £(P) 


pqGE(P), 








J2 x ™ - 1 


Vp G P 


qeP.q^P 




X pq > 


Vpq G E (P) . 



a'(P, t) = min 


t J2 








psp 


s.t. ^ z t 




> 1 


<?e£(P), 




Vpq G E (P) 


zt>0 




W G L(P) 


z p >0 




Vp G P. 
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We will next show that 7'(P, y/n) is feasible. This would imply that one can find spanning 
graph of P with crossing number of 0(y/n) that uses a constant fraction of the vertices (i.e., 
Lemma 12.4ft . 

Lemma 3.3 The LP V(P, t) is feasible for t = y/n. 

Proof: Consider the dual LP above and observe that it is always feasible (for example by 
setting zi — 1 for all £ G £(P) and z p = for all p G P). Thus, if we show that a'(P,t) is 
bounded from below (and thus is finite), then the strong duality theorem would imply that 
7'(P, t) is feasible and equal to a'(P,t). 

So consider a solution to this dual LP, where all the values are rational numbers. Let 
U > 1 be the smallest integer such that if we scale all the values in the given LP solution by 
U then they are integers. In particular, let ye = Uze, for all £ G £(P), and y p = Uz p , for all 
pGP. 

Let L be a set of lines, where we pick yi copies of t into this set, for all t G £(P). 
Formally, -0 copies of the same line I (put into L) will be a collection of i/j, almost identical, 
copies of the line £ slightly perturbed so that these ip lines are in general position. Thus, L 
is a set of N — U X]£g£(p) z £ l mes m general position. Furthermore, the inequalities in the 
LP implies that, for any segment pq G E(P), we have that pq crosses 

<*£(p,q)= ^2 yt = u z e> u ( 1 + z P + z q) = u + yp + yq 

lines of L. 

Observe that Di(p,y p ) n Di(q,y q ) = for any pair pq G E(P). Otherwise, there would 
be a point r in the plane such that di(p, r) < y p and dz,(q, r) < y q . But the triangle inequality 
would imply that dz,(p, q) < y p + y q , which contradicts the above. 

By Lemma [372l for p G P, the disk D £ (p,r) contains at least ( 2/p ^ 1 ) distinct vertices of 
A(L). On the other hand, the total number of vertices in the arrangement A(L) is (^). We 
conclude that 

pGP pGP V / \ / \^ eL / pe p \^ eL / 

Now, by the Cauchy-Schwarz inequality and the above, we have that 



pgp y pgp teL 

As t = \/n this implies that a'(n, t) = \/nJ2eeL z t ~ S P eP z p — ®- ^he claim now follows. ■ 

Remark 3.4 The proof of Lemma [331 works also in higher dimensions, where we consider 
points and hyperplanes in TR d . There, one has to use Holder's inequality instead of the 
Cauchy-Schwarz inequality. Then, the LP is feasible for t = 0(n 1_1 / d ). 
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Theorem 3.5 Given a set P of n points in R d ; one can compute (in polynomial time) a 
spanning tree T of P with crossing number at most 0(n 1_1//d ); that is, any hyperplane in IR d 
crosses at most 0{n 1 ~ 1 ' d \ edges ofT. 

Proof: By Lemma 1331 and Remark 13.41 r /(P,t) is feasible, for t = 0{n l ~ l l d } . As such, by 
Lemma 12.41 we can compute a set of edges F that engages a constant fraction of the points 
of P, and it has crossing number 0(t + logn/ log log n) = 0(t). Using the algorithm of The- 
orem [23] generates a spanning tree with crossing number C{n) = 0(n l ~ l l d ^} + C((19/20)n). 
Namely, the resulting spanning tree has crossing number 0(n 1_1 ' d ). ■ 

Remark 3.6 (Connection to separating/hitting and packing LPs.) It is interesting 
to consider the LP that just tries to separate all points of P from each other. It looks similar 
to our dual LP while being simpler. 



sep(P) = min 


z * 




s.t. ^2 z t 


> 1 


Vpq G E (P) 














zt>0 




W G L{P) 


z p >0 




Vp G P. 



Now, a similar scaling argument to the one used in the proof of Lemma [3.31 implies that 
5^g£(p) zi > y/n/2. Namely, any fractional set of lines separating n points in the plane is 
of size fl(y/n). Naturally, this argument works also in higher dimensions, where a fractional 
set of hyperplanes of size f2(n 1-1 ^) is required to separate a set of n points in R d . 

Observe, that since the LP a'(j, t) is more restrictive than this LP, we conclude that for 
any feasible solution to 0/(7, t) it holds that X^e£(P) z t — vW2- 

The dual to the above LP is the packing LP that tries to pick as many fractional edges 
as possible, while no line crosses edges with total value exceeding 1. While this is similar to 
our primal LP j'(P,t), it is not clear how to round it, since we do not have the guarantee 
that every point has sufficient number of edges attached to it in the fractional solution. 

3.1 A Deterministic algorithm for the planar case 



Interestingly, at least in the planar case, 
one can do the rounding deterministically. 

Lemma 3.7 Let P be a set ofn points in the 
plane and a parameter t, such that 7'(P, t) 
is feasible. Then, one can compute, in poly- 
nomial deterministic time, a set of edges F, 
such that (i) the crossing number of F is 
< 12t, and (ii) the number of connected com- 
ponents in (P, F) is < (3/4)n. 



min || p — q 


x pp (2) 


pqGE(P) 




S,t, ^^^^^ *-^pq t 


W G £(P) 


pqeE(P), 




pqn£^0 




Yl x ™ - 1 


VpGP (*) 


qGPq^p 




X pq > 


Vpq G E(P). 



Figure 1: The modified LP 




Proof: Instead of computing p = ^y'(P,t) we slightly modify the LP so that it finds the 
"shortest" such solution. The resulting modified LP is depicted in Figure CD 

Let H be the set of all the edges pq in the solution to this LP such that 
x pq > 0. We claim that this set of edges is planar. Indeed, if two such 
segments e and e' intersect, then consider two opposing edges / and /' of 
the quadrant formed by the convex hull of the endpoints of e and e', see 
figure on the right. 

We have that ||/|| + ||/'|| < ||e|| + ||e'||, which implies that, for 5 > sufficiently small, 
the solution x e = x e — 5, x e > = x e i — 5,Xf = Xf + 5,xp = Xp + 5 is feasible, as the total 
value of the edges attached to a vertex does not change, and the crossing number of any 
line does not increase by this change, as can be easily verified. But this implies that there 
is a feasible solution with a better target value (specifically, the target value goes down by 
<5 ■ (||e|| + ||e'|| - H/ll - \\f'\\)). A contradiction. 

Thus G = (P, H) is a planar graph where each point of P has at least one edge attached 
to it. Furthermore, the average degree in a planar graph is at most 6, which implies that at 
least half of the points of P have degree at most 12 in G. But each such point p, must have 
an edge pq attached to it, such that x pq > 1/12, because of (*) in the LP ((2j). 

Thus, scale the LP solution by a factor of 12 and pick all the edges pq with 12x pq > 1 into 
the set F. We get a set that is adjacent to at least half of the points of P, and its crossing 
number is at most 12t. ■ 

The planarity argument in the above proof of Lemma 13.71 is similar to the one used by 
Fekete et at. |FLM08| - they use it to argue that there is one heavy edge, while we use it to 
argue that there are many heavy edges. 

Theorem 3.8 Let P be a set of n points in the plane. One can compute, in deterministic 
polynomial time, a spanning tree T of P with a crossing number 0(min(t logn, \/nf), where 
t is the minimum crossing number of any spanning tree of P. 



4 Conclusions 

We presented an approximation algorithm for computing a spanning tree with low cross- 
ing number. The new algorithm relies on a natural LP relaxation of the problem and a 
straightforward rounding scheme. 

Interestingly, our approach enables us to provide a direct proof to the existence of such 
spanning trees in R d . This is, as far as we know, the first algorithm for this problem that 
avoids using the reweighting technique. Intuitively, our algorithm (together with previous 
results |Lon01| ) suggests that reweighting in geometric settings can sometimes be replaced by 
LP rounding. This is a significant feature, as LPs are considerably more general and flexible 
tool than reweighting. For example, using our algorithm, we can add other constraints to 
the LP; e.g., we can insist that some certain cuts would have significantly lower crossing 
number than some other cuts. In particular, it is not clear how one can incorporate such 
considerations into a reweighting algorithm computing spanning trees with low crossing 
number. 
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One interesting open problem, is to compute spanning trees with relative crossing number 
using the new LP approach. Here, given a point set P in IR 3 , one would like to compute a 
spanning tree T such that if a halfspace h + contains k points of P then it boundary plane 
h crosses (say) 0((k\ogn) 2 ^ 3 ) edges of T. Such a result is known in the plane |AHS07| . but 
the problem is open in higher dimensions. 
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